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We search for the optimal quantum pure states of identical bosonic particles for applications 
in quantum metrology, in particular in the estimation of a single parameter for the generic two¬ 
mode interferometric setup. We consider the general case in which the total number of particles is 
fluctuating around an average N with variance AN^. By recasting the problem in the framework 
of classical probability, we clarify the maximal accuracy attainable and show that it is always larger 
than the one reachable with a fixed number of particles (i.e., AN = 0). In particular, for larger 
fluctuations, the error in the estimation diminishes proportionally to 1/AN, below the Heisenberg- 
like scaling 1/N. We also clarify the best input state, which is a “quasi-NOON state” for a generic 
setup, and for some special cases a two-mode “Schrodinger-cat state” with a vacuum component. In 
addition, we search for the best state within the class of pure Gaussian states with a given average 
N, which is revealed to be a product state (with no entanglement) with a squeezed vacuum in one 
mode and the vacuum in the other. 


PACS numbers: 03.65.Wj, 03.65.Ta, 42.50.St, 06.20.Dk 


I. INTRODUCTION 

By making use of quantum-mechanical features, such 
as quantum superposition, entanglement, or squeezing, 
we are able to go beyond classical technologies. One of 
such promising ideas is “quantum metrology” because of 
its possible applications [l-IO]. When one wishes to es¬ 
timate some quantity or parameter of a physical system, 
one typically tries to do it by analyzing the data col¬ 
lected by performing a number of independent and iden¬ 
tical experiments, or by sending a number of independent 
probes to the target. The error in the estimation scales 
as 1/y/lV (shot noise or standard quantum limit) and di¬ 
minishes as the number of probes N increases. On the 
other hand, it has been recognized that such scaling can 
be beaten by quantum-mechanical effects. In particular, 
the possibility of estimating phase shifts at the “Heisen¬ 
berg limit,” with errors scaling as 1/N, has been revealed 
with interferometric setups, which exploit the possibility 
of employing quantum correlations in the input states of 
the probes (the optimal choice being identified with the 
so-called “NOON states”) [II, 12]. 

Analogous results hold also when the total number of 
probes is not exactly fixed at some value but is allowed to 
fluctuate around an average number N. Such situations 
are often found in real experiments, e.g., when measuring 
an optical phase difference in a two-port Mach-Zehnder 
interferometer [13-21]. Also in this case, one may rec¬ 
ognize the existence of a I/VN scaling associated with 
the accuracy attainable, when employing classical light 
sources (say coherent states) as probing signals. As in 


the fixed-number scenario, this threshold can be over¬ 
come by properly employing probes exhibiting quantum 
characters (say squeezing and/or entanglement). In this 
case, however, the formal equivalence to the Heisenberg 
limit appears to be not as fundamental as in the fixed- 
number configuration: due to the presence of large fluc¬ 
tuations in the number of probes, violations of the 1/N 
scaling of the optimal accuracy are indeed possible. To 
get realistic results, extra constraints have to be imposed, 
either on the structure of the input signals [17-19], or on 
the amount of squeezing allowed in a single experiment 
[13], or finally on the fluctuations of the total number of 
particles involved in the experiment [16, 20]. 

In this article, we explore the quantum metrology with 
bosonic particles (e.g., photons) used as probes, for the 
most generic two-mode interferometric setup (where the 
total number of probe particles is preserved). We con¬ 
sider the case where the number of bosons is not exactly 
fixed but can fluctuate around an average value N with 
a certain standard deviation AN. In this general setting, 
we focus on the ultimate precision limit for the estimation 
of a parameter ip of the generic two-input and two-output 
circuit described by a scattering operator Sy, [see Fig. 
1(a)]. Under the assumption of pure input probes, we 
find the exact expression for the quantum Gramer-Rao 
bound Sip > Spniim which sets the limit to the uncer¬ 
tainty Sp in the estimation of p. Specifically, one gets 
[2, 3] 

Sprain = / ; ( 1 ) 
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where v is the number of trials and (tp) is the op¬ 

timal quantum Fisher information (QFI). We show that 

= (|e+ - £_ I + AiV2 + |e+ + e_ | Aiv) " , 

( 2 ) 

where £± are constants encoding the physical properties 
of the generic two-port circuit (as described in Sec. II). 
This is the central result of the present work. In par¬ 
ticular, one notices that for an antisymmetric configu¬ 
ration (i.e., £+ = the above expression predicts 

a scaling l/'/?P~+~AN^ for i5<pmin, generalizing the re¬ 
sults obtained in Refs. [16] and [20] for a Mach-Zehnder 
interferometer. On the other hand, for a symmetric case 
(i.e., £+ = e_), we get a 1/AN scaling. Such simple 
prototype cases are compared in Sec. V. More generally, 
Eq. (2) explicitly shows that the Heisenberg-like scaling 
1 /N for d(/5„iin can be beaten by the presence of number 
fluctuations. Indeed, irrespective of the values of e±, by 
exploiting large number fluctuations AN N, one can 
get a very small estimation error d(/3„iin oc 1 /AN 1 /N. 
The best input state that allows us to achieve the ulti¬ 
mate QFI in Eq. (2) is clarified to be a “quasi-NOON 
state” (see Sec. IVA), or in some special cases a two¬ 
mode “Schrodinger-cat state” with a vacuum component 
(see Sec. IVB). We also identify the best input state 
among pure Gaussian states and see how close we can 
get to the above ultimate precision by a Gaussian state, 
which would be much simpler to generate than the quasi- 
NOON state or the two-mode Schrodinger-cat state with 
vacuum (see Sec. VIB). Remarkably, the best pure Gaus¬ 
sian state is a product state with no entanglement: it is 
simply a single-mode squeezed vacuum. 

This article is organized as follows. In Sec. II, we 
review the basic definitions of scattering operator and 
Fisher information in the context of quantum metrology, 
and set up our problem. In Sec. Ill, we rephrase the 
problem of the optimization of QFI in terms of classical 
probability, and solve it to get the maximal QFI shown in 
Eq. (2). The optimal input state that allows us to achieve 
the maximal QFI is exhibited in Sec. IV, and some spe¬ 
cial cases are considered in Sec. V. We also solve the 
optimization problem within the restricted class of pure 
Gaussian states, which are of interest in quantum optics, 
in Sec. VI. Conclusions are finally summarized in Sec. 
VII. 


II. BASIC SETUP AND FRAMEWORK 



FIG. 1. (a) A generic two-input and two-output linear 

(particle-number preserving) unitary circuit. The ports ± 
are associated respectively with the modes a± iir Eq. (3). 
(b) Mach-Zehnder interferometer as an example of the two- 
input and two-output linear circuit (a). In this special con¬ 
figuration, incoming photons go through the first balanced 
beam splitter described by the unitary scattering operator 

Ubs = acquire a relative optical phase 

ip between the two paths of the interferometer by V), = 
^-i(ip/ 2 )(a^a^-a_a_) ^ pass the last balanced beam split¬ 
ter t/^g. The overall unitary transformation is therefore given 

by Sip = U^qVpUbs = corresponding to 

setting the parameters in Eq. (6) to j3 = (p/2, y = r = 0, and 
p = — 7r/2, which yield the matrix elements in Eq. (3) equal 
to T± = cos((p/2) and R± = ±sin((p/2). The generator Hip 
in Eq. (8) is instead given by H^p = {i/2){a/a+ — a]^a_), cor¬ 
responding to setting the functions in Eq. (9) to A± = 0 and 
B = i/2. In the diagonalized form (10), we have e± = ±1/2. 


shift (p by injecting photons into the input ports [Fig. 
1(b)]. The following analysis however is not restricted 
to such a specific setup, and the parameter ip can be 
something more general. In particular, we describe the 
action of the circuit on the input state by a scattering 
operator Stp preserving the total number of particles. It 
induces the canonical transformation 


S 


V 





(T+ R+\ 

\R- tJ 



(3) 


where are the creation operators for bosons incoming 
to and outgoing from the ports of the circuit labeled “±”, 
and where T± and R± are complex-valued functions of 
ip which define a 2 x 2 unitary matrix by fulfilling the 
constraints 


\t±\^ + \R±\^ = \t±\^ + \Rt\^ = i, 

T^Rpf: + RlT^ = TlR± + = 0 . 


(4) 


A. The Model 

We consider a generic two-input and two-output linear 
(particle-number preserving) unitary circuit [Fig. 1(a)]. 
Our problem is to estimate a single parameter ip of the 
circuit, by injecting bosonic particles into the circuit and 
observing its output. A typical example is the Mach- 
Zehnder interferometer used to measure an optical phase 


When feeding the device with a two-mode input state 
|'0o) (this is the initial state of the probing signal), it 
outputs the state 

IV'v) = -^vIV'o), (5) 

which is the one we can monitor in order to recover the 
value of the parameter p. 
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B. Normal Modes 


C. Fisher Information 


The conditions in Eq. (4) lead to the parameterization 
of the matrix elements 


(7) 


T± = e cos/?, R± = —ie sin/?, (6) 

with /3, Xi Slid P being arbitrary real-valued functions 
of (f. Accordingly, Sip can be expressed as 

— g-ix(a+a++aLa-)g-ir(at,a+-aLa_) 

We then introduce its generator by 

fr - Qt 
^ ~ ^ dip 

= A+o^_a+-I-A_aLa_-I-i?aLa-i-+-B*a+a_, (8) 

where A± and B are the following implicit functions of 

ip-. 


dip 2 \ dip dip 


(9) 


The generator Hip defined as in Eq. (8) plays an impor¬ 
tant role in evaluating the optimal precision of the esti¬ 
mation of the parameter ip later. Since Hip is Hermitian, 
by explicit diagonalization we can write 


Hip = e+c+c+ -I- e_cLc-, 


( 10 ) 


where 


£± = 2 sgn(A+ -t A_) 


X (|A+ + A_| ± v'(4l+ - A_)2 + 4|i?|2) , (11) 

with sgna; = 1 for x > 0 and —1 otherwise, and where 
Ba+ + (£± — A+)a 


c± = 


V(£±-A+)2 + |il|2 


( 12 ) 


are the normal modes satisfying the canonical commuta¬ 
tion relations. They are related to a± by a unitary trans¬ 
formation preserving the total number of probe particles. 


N = a)ga+ -I- d^d- = c)gC+ -I- cLc_. 


(13) 


The simplest strategy for estimating the parameter ip 
of the two-mode linear circuit in Fig. 1(a) is to inject 
a single probe particle into port -|- and see whether it is 
output from port -|- or from port —. We repeat this exper¬ 
iment many times (but a finite number of trials v), collect 
the data, and evaluate the probabilities P+ = P{+\ip) = 
|T+p = cos^/3 and P_ = P{—\ip) = |i?+P = sin^/? for 
the respective possible outcomes. By comparing these 
probabilities with their theoretical predictions, the pa¬ 
rameter ip is estimated. The ultimate precision of this 
estimation can be evaluated on the basis of the Cramer- 
Rao inequality [2, 3, 22-25]: the uncertainty Sip in the 
estimation of ip is bounded as 


dip > 


1 

'JvFi.ip) ’ 


(16) 


where 

(17) 

is the Fisher information (FI) of the procedure. Different 
detection strategies, i.e., not simply checking whether the 
probe particle comes out of either port -I- or port — but, 
for instance, checking whether the output state of the 
probe particle is in a superposition state of the outputs 
from port -|- and port —, might provide us with better 
estimation. The maximum FI attainable by means of an 
optimal measurement is known to be expressed by the 
so-called QFI [2, 3, 24-26], which for the pure output 
state \il)ip) in Eq. (5) can be computed as the variance of 
the generator Hip in Eq. (8) evaluated in the input state 
l'0o), i-e.. 


(18) 


Specifically we have 

Fq{<p) = 4^(1 - = 4(Aff^)^„. 

(19) 

For the above-mentioned strategy, where we input a par¬ 
ticle from port -|-, one has \ipo) = a+|0) and hence 


Notice that Hp and N commute and admit as simulta¬ 
neous eigenvectors the Fock states |m,n) of the normal 
modes c+ and c_ (with m and n being non-negative in¬ 
tegers), the associated spectra being respectively 

E^n^n = e+mP £-n, Nm,n = m + n. (14) 

Notice also that, for our convenience, we ordered the 
eigenvalues £± (and the corresponding normal modes c±) 
such that 

e+ > £-• (15) 


Fq{<p) 


1 

P+P- 



2 


( djr - p) 

\ dp, 


, ( 20 ) 


which is clearly larger than or equal to F{ip) in Eq. (17). 
What is important is the fact that formula (19) can be 
optimized by tuning the input state |'0o) within the set 
£ of states allowed by the constraints we impose on the 
problem [3, 12], i.e., 

(ip) = max FQ{ip). (21) 
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Via the quantum Cramer-Rao inequality [2, 3, 22-25] this 
provides the minimal uncertainty in the estimation of ip 
through Eq. (1). 

Determining for a generic two-mode linear 

circuit is the goal of the present work. In particular, 
we study the scenario where constraints are imposed on 
the total number of particles involved in the procedure. 
Specifically, given N and AN positive constants, we ad¬ 
dress the case where the set £ identifies all pure states 
[27] which have an average number of particles equal to 
N and a variance equal to AiV^, i.e., 

£ = ^N.AN 

= {jV'o) I (V'oliVj^o) = N and (AiV)|„ = AV^}, (22) 

where (AfV)|^ = - (V’ol^lV'o)^- The case 

^N,AN=o corresponds to the fixed-number scenario where 
exactly N particles enter the circuit. 


where Em,n are the eigenvalues (14) of H^, while 

Var[E^,„] = ]E[F^,„] - ]E[F^.n]^ 

G*m,n] — IE[Em,nG^m,n] IE[T7ri,n] lE[f7m,n] 

are the variance and the covariance on Pm,m respectively. 
Similarly, the constraints (22) are expressed as 

E[m -I- n] = V, (28) 

Var[TO + n] = ATV^, (29) 

which have to be obeyed by the probability distribution 
Pm,n- In this way, the variance of in Eq. (26) (and 
hence the QFI) and the constraints (28) and (29) are 
all independent of the phases of the amplitudes Xm,n, 
and the phases are irrelevant to the optimization of the 
QFI. The optimization of the QFI is thus reduced to the 
optimization of the classical probabilities (24). 


III. OPTIMIZATION OF QFI AS A PROBLEM 
OF CLASSICAL PROBABILITY 


A. Fixed Number of Probes 


We notice that the generator H^p in Eq. (10) and the 
particle number operator N in Eq. (13) are both com¬ 
posed of the sum of commuting observables, namely, the 
number operators c\.c±. Therefore, the problem of the 
optimization (21) of the QFI under the constraints (22) 
can be analyzed in terms of classical probabilities, when 
considered in the basis {|m,n)} diagonalizing c^_c+ and 
cLc- simultaneously, with eigenvalues m and n, respec¬ 
tively. Indeed, for a generic pure input state 

l'0o) = X! Xm,n\m,n), (23) 

m,n>0 

we will see that the only relevant parameters for the prob¬ 
lem are the classical probabilities 


Consider first the simple case where we look for the 
maximum of the variance (26), namely, of the QFI in 
Eq. (19), under the constraint that the total number of 
impinging particles is fixed and equal to some integer 
value N with no fluctuation (i.e., £ = £Ar,AAf=o)- This 
implies that the possible input states |'0o) of the problem 
have to be the eigenstates of the total number operator 
N with eigenvalue N, i.e., states of the form (23) char¬ 
acterized by amplitudes Xm,n different from 0 only for 
values of m and n fulfilling the condition m + n = N. 
Obviously, this forces the two variables to be linearly de¬ 
pendent. In particular, this implies that the joint prob¬ 
abilities Pm,ra should collapse to the single variable dis¬ 
tribution Pn = PN-n,n- We Seek for a joint probability 
distribution Pm,n that maximizes the variance (26) under 
such a constraint. Under this condition Eq. (26) becomes 


Pm,n — IX' 


2 

m,n I 5 


E 

m,n>0 


(24) 


the phases of the amplitudes Xm,n being completely irrel¬ 
evant. To see this explicitly, for a generic function Fm,n 
of the integer variables m and n, define 


iAHp)l^ = Var[iVe+ - (e+ - e_)n] = (e+ - S-f Var[n], 

(30) 

so that the problem reduces to finding the probability 
distribution P = {Pn}n=o,...,N that maximizes the vari¬ 
ance Var[n] of a positive integer random variable n with 
values in {0,..., N}. We get 


E[Fm^.,^] — ^ ^ Pm,nPm,m 

m,n>0 

which is the average of Fm,n with respect to the proba¬ 
bility distribution Pm,n- Then it is simple to verify that 
the variance (18) of on \tjjQ) can be written as 

= Var[U^,„] 

= e\ Var[TO] -|- e?_ Var[n] -I- 2e_|_e_ Cov[m, n], 

(26) 


1 X V 1 

max Var[n] = max - PmPni'm — =-N^ (31) 

m,n—0 

at the uniqne point 

Pn = -^5n,0 + 2 ^n,N- (32) 

The optimal state that maximizes the variance of the 
generator F[^ is now clear: it is the snperposition of the 
state belonging to the maximnm eigenvalne jlV, 0) and 
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the state belonging to the minimum eigenvalue |0, 
with an arbitrary phase (j), 

l^o) = INOON) = ^(|iV, 0) + N)). (33) 

This is a so-called NOON state [3, 12, 29]. With such an 
input state the QFI (19) is maximal and reads 

F^“"")(^) = N^{e+ - e.r = N^[{A+ - A.f + 

. 

which corresponds to a minimal uncertainty 6ipmin in Eq. 
(1) exhibiting the typical 1/iV Heisenberg scaling (for 
comparison, observe that the QFI associated with an ar¬ 
bitrary Fock state |— n, n) is always null). Notice also 
that in the symmetric case, e+ = £-, the maximal QFI 
(34) vanishes, implying that in this circumstance the pa¬ 
rameter ip cannot be recovered using states with a fixed 
number of particles, the minimal uncertainty (I) being 
unbounded. 




FIG. 2. (a) The domain on the variance xy-plane (shaded 
region) where maximization of the function = (AH^,)^^ is 
to be performed. The Cauchy-Schwarz inequality (41) is sat¬ 
urated along the parabolic boundary curve (43), while the in¬ 
equalities in Eq. (45) are saturated along the straight line (47) 
connecting C+ and C_ given in Eqs. (48) and (49). (b) The re¬ 
lationship (60) between m and n, when the Cauchy-Schwarz 
inequality (41) is saturated, along the parabolic boundary 
curve in (a). 


B. General Setting 

Consider next the case with nonvanishing /S.N. In 
other words, we do not fix the total number of parti¬ 
cles m -I- n as in the previous section, but let it fluctuate 
around its average N with a variance AN'^. Our task 
is to optimize the QFI, i.e., to optimize {AH^ )lo in Eq. 
(26), under the constraints (28) and (29) on the total 
number of particles. 

We first notice that for the symmetric case with £+ = 
£_ = £ the eigenvalues of the generator are given by 
Em,n = e{m + n), which are proportional to the eigenval¬ 
ues m -|- n of the total number of particles N. Therefore, 
the maximization of {AHip)^^ under the constraint (29) is 

trivial, because {AH^)'^^ = e^AN'^ for all |'0o) € Cat,aw, 
yielding 

F^'"'^\ip) = AN^. (35) 

Let us hence assume that £+ £-. This is always the 


case if H 0 [see Eq. (11)]. 

To simplify the notation we set 

a;:=Var[m], ?/:=Var[n], z := Cov[to, n], (36) 

:= {AH^)1,. (37) 

In this notation, Eqs. (26) and (29) read 

= e\x + e^_y+ 2e+e-Z, (38) 

x + y + 2z = AN‘^, (39) 


with x,y > 0. Using the constraint (39), z is removed 
from the formula for h? in Eq. (38) to yield 


which is a function of x and y. Our problem is therefore 
to maximize in Eq. (40) within the region on the xy- 
plane allowed under the constraints (28) and (39). 

The region on the xy-plane is limited as follows. Eirst, 
the covariance z = Cov[m, n] is bounded by the Cauchy- 
Schwarz inequality |Cov[m,n]| < -y/Var[m] Var[n], i.e., 

|z| < y/xy. (41) 

Combined with Eq. (39) it yields the bounds on x and y, 
—2y/xy <x + y — AN'^ < 2^/xy. (42) 

Its boundary is given by a parabola on the a;?/-plane, 

{x-yf-2AN^{x + y) + AN^ =0, (43) 


which is symmetric with respect to the line x = y and 
tangent to the x and y axes at (AAf^,0) and (0, AAf^), 
respectively. See Fig. 2(a). Another bound on x and y 
is obtained using the inequality between arithmetic and 
geometric means, combined with the constraint (28), i.e.. 


EM E[n] < 


^ eH + iEN y 


N^/A. 


(44) 


Recalling the definition of the covariance and the fact 
that mn > 0 we find 


z = Cov[m, n] = E[mn] — E[m] E[n] 

> - EH E[n] > -N'^/A. (45) 

Combined with Eq. (39) it bounds x and y as 

x + y<N‘^/2 + AN‘^. (46) 




= (£_|_ — £_)^ 


(40) 


Its boundary is a line on the xy-plane, 
x + y = AV 2 +AAf2, 


2 


(47) 
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intersecting the x and y axes at {N'^/2 + AN‘^,0) and 
(0, N^/2 + AN^), respectively. See Fig. 2(a). 

Summarizing, the maximization of in Eq. (40) has to 
be carried out in the domain whose boundary is defined 
by the parabola (43) and the line (47) as shown in Fig. 
2(a). The corner points C± are given by 

= (iVV±/4,iVV^/4), (48) 


which implies, by (37) and (19), 

E^“"")(y,) = (|£+ - e_| v'iV2 + A7V2 + |e+ + 

= ( 1 /(^ 4 + + yl_)2 + 4|B| + AfV2 

+ \A++A.\ANy. 

(56) 


with 


cr± = 1 + 


2AN^ 

iV2 


± 



2AN^ 



(49) 


It is easy to see that, within this domain, does not 
have a stationary point. Indeed, by direct calculation we 
find 


This formula is the central result of the paper. It shows 
the dependence of the maximal QFI on the average N 
and the variance ATV^ of the total number of probe par¬ 
ticles. Moreover, on one hand, if we fix the number of 
particles at N, and thus set AN = 0, we recover the 
Heisenberg limit found in Eq. (34). On the other hand, 
the fluctuation AN enhances the QFI, and for AN ^ N, 
we get 


dh^ 

dx 


£+(£+-£-), 


dh^ 

dy 


-£-(£+-£-), (50) 


which cannot vanish simultaneously since we are assum¬ 
ing £+ ^ Therefore, the maximum of h'^ must be 
searched on the boundary of the domain. Let us hence 
evaluate h? in Eq. (40) along the parabola (43), by re¬ 
moving x-by from h?' in Eq. (40) using the parabola (43), 
to get 


^para “ ^ ^AN'^ ^ “ E-)^ + ^ (^-l- + £-)‘^AN‘^, 

(51) 

where ^ = (a; — y)/2. Due to the convexity of Eq. (51) as 
a function of we have that 


maxhpj,j.g^ = max{h^,/i?_} =/i^, (52) 


where are the values of h'^ at the end points C±. If 
we perform the same analysis for the line defined in Eq. 
(47), we find 

^Une = (£+ “ £-)'f + ~ ^-)^^^ + 2 + £-)AN'^. 

(53) 

If £^ = £?., then is constant along the segment (47) 
between C+ and C-. Otherwise, is maximal at the 
end point C+, i.e., 

max/ifi„3 =(54) 


as in Eq. (52). 

Finally, by inserting Eqs. (48) and (49) into Eq. (51) 
or Eq. (53), we see that in any case (including the case 
£^ = £?-), we have 


max h'^ 
p 




— £_ 






■ £- 


.AN 

IT 


2 


(v?) = 4£2 AiV2, (57) 

independent of the average number of particles N. 


IV. OPTIMAL INPUT STATE 

In this section we are going to determine the input 
state 1 ^ 0 ) that yields the maximum value of the QFI 
(56) obtained in the previous section. 


A. For e_|_ + e_ 7 ^: 0: Quasi-NOON State 

We recall that, when the maximum of in Eq. (55) is 
achieved, the inequalities in Eq. (45) are saturated. This 
implies [recall the condition for the equality between the 
arithmetic and geometric means in (44)] 

E[mn] = 0, E[to] = E[n] = N/2, (58) 

and hence, Cov[m, n] and the constraint (39) are reduced 
to 

Cov[to, n] = —fV^/4, Var[m] -I- Var[n] = fV^/2 -|- AfV^. 

(59) 

When £+-!-£_ 7 ^ 0 the Cauchy-Schwarz inequality (41), 
i.e., |Cov[m,n]| < y^Var]™] Var[n], is also saturated. 
This implies (recall the condition for the saturation of 
the Cauchy-Schwarz inequality) that the random vari¬ 
ables m and n are linearly dependent, 

n = am + P (a < 0 ), (60) 

with a and /3 real numbers. Due to the negative covari¬ 
ance in Eq. (59), the coefficient a is negative, and the 
ranges of m and n are limited: see Fig. 2(b). In addition, 
we know that the maximum of is reached at the end 
point in Eq. (48) [since £^ > see Eq. (52)], i.e., 
when 


(55) 


(Var[m], Var[n]) = (N^a+/4, N^a-/4), (61) 
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where a± are defined in Eq. (49). From these conditions, 
a and (3 are fixed. By inserting Eq. (60) into Eqs. (58) 
and (59), we have 

Na + 2I3 = N, a VarH = -N^/A, (62) 

and, by taking Eq. (61) into account, we get 

a = —(T_, /3 = A^(l + (T_)/2. (63) 

It derives that the ranges of the random variables m and 
n are limited by 

0 < m < iV(l+ cr+)/2, 0 < n < iV(l+ cr_)/2. (64) 

We are now ready to characterize the structure of the 
optimal input state that maximizes for the case £+ + 
£_ ^ 0. Due to the linear dependence (60), the random 
variables m and n are perfectly correlated, and we have 
only to look for a probability distribution Pm for the 
variable m. The first condition in Eq. (58) implies mn = 
0, and, therefore, either m or n should vanish. When 
n = 0 we have m = A^(l + (T+)/2. Thus, the solution is 

Pm = A(5m,Ar(l+cr+)/2 + (1 ~ ^)<5m,0 (0 < A < 1). (65) 

This probability distribution should satisfy the condi¬ 
tions in Eqs. (58) and (61), i.e., 


EN = XN{1 + a+)/2 = N/2, 

Var[?7i] = A(1 — A)A^^(1 -I- cr+)^/4 = iV^cr+/4, 


( 66 ) 


which are compatible with each other with the choice 


A = 


1 

1 + 17 + 


(67) 


Therefore, the optimal input state takes the form 


y 1 -k cr+ 

+ ^Y^P‘^\0,N{l + a-)/2), ( 68 ) 

where the phase (f) is arbitrary. This state is a defor¬ 
mation of a NOON state with different weights {quasi- 
NOON state). In the limit of vanishing fluctuation 
AN/N —>■ 0, we have a± —>■ 1, and the state (68) re¬ 
duces to the NOON state (33). 

Notice that the variables m and n are integers while 
they should take the values m = N{1 + a+)/2 and 
n = N{1 + a-)/2 in the state (68), which are in general 
nonintegers. Precisely speaking, the true optimal state is 
different from the one given in Eq. (68) in such a generic 
case, and the QFI given in Eq. (56) is not reachable. The 
error, however, is negligibly small, of order 1/iV, when N 
is large. (Moreover, this problem can be removed with a 
proper choice of the ratio AN/N.) 


Note also that the optimal state (68) is a quasi-NOON 
state defined in the basis diagonalizing the generator H^p 
as Eq. (10), i.e., in the normal modes c±, not in the phys¬ 
ical modes a±. Since the unitary transformation relating 
the physical modes a± to the normal modes c± is a uni¬ 
tary transformation preserving the total number of par¬ 
ticles as Eq. (13), such a transformation can in practice 
be realized by a linear circuit composed of beam split¬ 
ters and phase shifters. Therefore, the optimal state |^o) 
given in Eq. (68) to be injected into the target circuit can 
be generated from the quasi-NOON state prepared in the 
physical modes a± by sending it through an appropriate 
linear circuit realizing the diagonalization of the gener¬ 
ator Hip. Recall the formula for the QFI in Eq. (19) 
with Eq. (18), and consider the diagonalization of the 
generator Hp in that expression, to understand that the 
change of basis for the generator Hp induces the unitary 
transformation on the input state |V’o)- 

It is worth mentioning that in order to construct the 
optimal state one needs some knowledge on the parame¬ 
ter ip, which can be approximately learned via the follow¬ 
ing strategy: an initial small fraction of probes is used 
to acquire preliminary information on the value of ip in 
order to prepare the optimal input state for the subse¬ 
quent probes. This will add an extra cost to the proce¬ 
dure that, however, will be asymptotically negligible for 
large enough N. 


B. For e_|_ + e_ = 0: Two-Mode Schrodinger-Cat 
State with Vacuum 


In the case of an antisymmetric scatterer with e+ + 
£_ = 0 [e.g., the Mach-Zehnder interferometer in Fig. 
1(b), for which we have e± = ±1/2], the maximum of 
in Eq. (55) is reached anywhere on the segment with end 
points C+ and C- in Fig. 2(a). In this case the inequal¬ 
ities in Eq. (45) are saturated, and the conditions (58) 
and (59) are satisfied, but the Cauchy-Schwarz inequal¬ 
ity (41) is not necessarily saturated. We do not have 
the perfect correlation (60) between m and n in general. 
Still, we have a strong constraint, i.e., the first one in 
Eq. (58), which implies that either m or n should van¬ 
ish. Therefore, the relevant probability distribution that 
maximizes for e+ ± e_ = 0 is given in the form 

Pm,n ~ f^^n.oPm ± (f P)^m,0Pn (0 ± /T ± 1), (60) 

where Pm and are normalized probability distribu¬ 
tions. This probability Pm,n should satisfy the conditions 
in Eqs. (58) and (59), i.e., 


EH = E[n] = N/2, 

E[to^] ± EK] = iV^ ± 


( 70 ) 







namely, 


' mpm = (1 - = N/2, 

m n 

< 

p.m^pm + (1 - At) n^Pn = iv^ + ATV^. 

^771 n 


(71) 


There are infinitely many triples {PrmPmPi) satisfying 
these conditions, among which we have the above two 
explicit examples (65), i.e., Pm = i5m.Ar(i+(T+)/2 a-nd 
Pn = (5n,Ar(i+cr_)/2 with p = l/v^i + a+. That is, the 
quasi-NOON states in Eq. (68) give the optimal QFI 
also in this case. Another nontrivial example can be con¬ 
structed with Poissonian distributions 




1 

( ^ 

Pm — 

to! 

V2At+ 


1 I 

( A ' 

Pn = 

n! 

K^P-, 




1± 


AN^ - N \ 

AV 2 + AiV 2 _ M j ’ 


P = P+, 


(72) 

(73) 

(74) 


where the probability amplitude associated with the vac¬ 
uum term is given by 

Co = VpPo + (1 - p)Po 

- v'Ate-^/2M+ - Y^(l - At)e-"/2M- (79) 

while the amplitudes a and d of the coherent states |a, 0) 
and |0, d) are 



It is worth observing that the vector (78) is a two-mode 
Schrodinger-cat state with three components, one being 
the vacuum and the other two having a mean photon 
number |ap in mode “-I-” and a mean photon number 
jap in mode ” both of which scale linearly with TV. 
Note again that the “modes” here are not the physical 
modes a± but the normal modes c±. 


V. SPECIAL CASES 


as long as AN^ > N. The set of probability distributions 
{Pm,n} satisfying the above conditions in Eq. (71) forms 
a convex set: any convex combinations of valid probabil¬ 
ity distributions Pm,n and P^ „ are also valid probability 
distributions, since the conditions in Eq. (70) are linear 
In Pjn^n- 

Each probability distribution (69) can be arranged as 

Pm^n — ^m,0 ^nfl[pPoP{^ — p)Po\p p5nfiQmP{^ — p)5mfiQn, 

(75) 

where Qm = Pm - Pa^mp and Qn = Pn - Po^np, and the 
corresponding pure state reads 


IV'o) = VpPo + (1 - p)Po\0,0) 

+ X! VPPm 

m>0 

+ (76) 

n >0 

with arbitrary phases (f>m and (j)n- As a special instance of 
such optimal states, it is worth considering the case with 
the Poissonian distributions (72)-(74) with the phases 
4>m and fpn having linear dependences on their indices, 
i.e., 

4>^ = mA(j) + (j)Q, (j)n = nA(j) + ^o, (77) 

where A^, A(p, and (po are arbitrary constants. Under 
this assumption, Eq. (76) can be expressed as a coherent 
superposition of three two-mode coherent states, 

l^o) =co|0,0) + y^e*^'>|a,0) +v^r^e*-^°|0,d), (78) 


In this section we discuss explicitly three paradigmatic 
examples of scattering processes, relevant to our previous 
discussion. 

The first setting is an example of an asymmetric con¬ 
figuration: see Fig. 3(i). It is equivalent to the Mach- 
Zehnder interferometer in Fig. 1(b) (but with <a>/ 2 re¬ 
placed by if ), apart from the first and the last beam split¬ 
ters, which are both irrelevant to the QFI and are used to 
change the input state and the measurement procedure. 
In this scheme the probes in the two input modes expe¬ 
rience opposite phase shifts. Accordingly, the generator 
Hip (i.e., a± = c±) is already in the normal form (10) 
with (e+,e_) = (1,-1) yielding 

= 4(A^^ + AA^) 

obtained by either quasi-NOON input states (68) or 
(when AA^ > A) the vectors (76). 

The second setting also involves two phase shifters, but 
in the parallel configuration: see Fig. 3(ii). This is an 


(i) (ii) (iii) 



FIG. 3. Three paradigmatic settings for the estimation of a 
phase shift ip: (i) antisymmetric configuration; (ii) symmetric 
configuration; (iii) unbalanced configuration. In all cases the 
input modes a± coincide with the normal modes c± of the 
system. 
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example of a symmetric setup where the generator is 
again in the normal form (10) with now e_) = (1,1). 
Accordingly, we get 

( 82 ) 

the optimal state being any state fulfilling the particle 
variance constraint. 

Finally, the last setting is an example of an unbalanced 
scatterer, where the probe in one arm does not encounter 
a phase shifter and just acts as a reference: see Fig. 3(iii). 
Also in this case H^p is in the normal form (10) with 
(e+,e_) = (1,0) yielding 


= (\/A^2 + + AfV)2 




+ AA2 + AN) ’ 


(83) 


the optimal probes being the quasi-NOON input states 
in Eq. (68). 

It is interesting to observe that, for fixed values of the 
constraints N and AN, the antisymmetric configuration 
(i) yields the best estimation uncertainty Sipmin, the un¬ 
balanced configuration (iii) gives the second best uncer¬ 
tainty, while the parallel configuration (ii) is the worst of 
the three. 


VI. OPTIMAL GAUSSIAN INPUT STATES 

In the previous section, we have identified the optimal 
input states that allow us to achieve the optimal QFI in 
Eq. (56), for the generic two-mode linear circuits, with a 
given average and variance of the total number of probe 
bosons. In particular, the quasi-NOON state (68) is the 
optimal choice if e+ ^ £-■ If £+ = —£- and ATV^ > N, 
the Schrodinger-cat state with vacuum in Eq. (78) is one 
among infinitely many optimal states. However, generat¬ 
ing such exotic states might be very challenging in prac¬ 
tice, even in quantum optical implementations [30-38]. 
On the contrary, Gaussian states, including entangled 
states such as two-mode squeezed states, appear to be 
much easier to produce, and to some extent are readily 
available in the laboratory [39, 40]: identifying the opti¬ 
mal Gaussian input states which provide the best perfor¬ 
mance in our setting hence appears to be an interesting 
question. 

Quantum metrology using Gaussian states of light, in 
particular with two-mode interferometric setups, has also 
been eagerly studied in the literature [13, 15, 16, 18, 19, 
21]. We notice here that since the quasi-NOON states 
(68) and the Schrodinger-cat states (78) are not Gaussian 
states, the optimal QEI in Eq. (56) cannot be reached by 
a Gaussian state in general. How much is the gap be¬ 
tween the ultimate QFI in Eq. (56) and the one attainable 
by the best Gaussian state? What is the structure of the 
best Gaussian state? These are the questions we address 


in this section. Eor this purpose, however, we observe 
that in general it is not easy to variate the parameters 
characterizing Gaussian states so that the constraints on 
the average N and the variance AN"^ of the number of 
probe particles are always satisfied simultaneously. On 
the other hand, due to the Gaussianity of the probe state, 
fixing the average at N is enough to get a hnite optimal 
QEI. We therefore search for the optimal input state 
maximizing the QEI for the estimation of a parameter 
of the generic two-mode linear circuit in Fig. 1(a), among 
the Gaussian states fulfilling only the constraint on the 
average number N of probe particles. 


A. QFI with a Gaussian Input State 


Each Gaussian state is characterized by the covariance 
matrix F defined as 

Tii = + Zjh) - (5i)(%) {i,j = 1,... ,4), (84) 

with (■ ■ ■) denoting the expectation value in the Gaus¬ 
sian state [39, 40], where we have used the quadrature 
operators 



x± 


a± + 


y± 


a± — 


(85) 


and the displacement d, whose components are defined 
as 


di = {zi) (z = l,..., 4 ). (86) 


We focus on pure input states, as in the optimization 
problem studied in the previous sections. In this case all 
the eigenvalues of F (obtained using Williamson’s theo¬ 
rem [41]) are equal to 1/2, and F can always be decom¬ 
posed as [39, 40] 


F = ^RQ^R^, 


(87) 


where Q = diag(e’'+, e’'“, e“’'+, ) is the (diagonal) 

squeezing matrix and i? is a symplectic and orthogonal 
(thus, unitary) matrix. Due to its symplectic structure, 
R can be written as 


R = W'‘ 


u 

0 

0 

u* 


W, 




( 88 ) 

where [/ is a 2x2 unitary matrix (U* being its conjugate), 
IF is a 4 X 4 unitary matrix and I 2 is the 2x2 identity 
matrix. 

When injecting a pure Gaussian state characterized 
by a covariance matrix F and a displacement d into the 
two-mode linear circuit in Fig. 1(a) the associated linear 
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transformation (3) maps it into a new Guassian state 
with a covariance matrix and displacement given by 


= R^pTR^, = R^d, (89) 


along the lines of Ref. [42], yielding 

2' 


FGiv>) = ^Tr- 




dip 


ddl dd^ 

dip ^ dip 


= - Tv{H^T-^H^T - H^} + d^ H^T-^H^d. 


(91) 


where is the unitary ((p-dependent) matrix defined by in this expression the 4x4 matrix is the generator 

of Rip, which, in a properly chosen basis, is given by 


Rp = W^ 


Up 

0 

0 

u; 


W, 


(90) 


where U* is the 2x2 matrix in Eq. (3) and W is defined 
as in Eq. (88). 

The QFI for the pure Gaussian state with a covariance 
matrix E,^ and a displacement d^ can then be computed 

I 


Hp=zRl^ = W^ 


dip 


e 

0 

0 

-£ 


W, £ = 


^e+ 0 ^ 
t 0 £_/ 


(92) 

with e± as in Eq. (11): the dependence of Fciip) upon 
the parameter ip is once more entirely encoded in these 
functions. We find it convenient to rewrite Eq. (91) as 




(93) 


with 


F(p {}p) = Tr{C/^e[/ sinh2r (U^eU)* sinh2r + {U^eU cosh2r)^ — e^}, 

F^Q\ip) = 2 f atec/ I eU* ) (— 

^ ' 2 \ sir 


cosh 2r 

sinh2r \ 

( [/t£a 

sinh 2r 

cosh2r J 

y U^ea* 


(94) 


where we have defined the 2x2 diagonal matri¬ 

ces cosh2r = diag(cosh2r+, cosh2r_), sinh2r = 
diag(sinh2r+,sinh2r’_) and the vector 

\a_/ \{a-)/ \{x-)+i{y-)/ 

The hrst contribution FQ\ip) to the QFI can be explic¬ 

itly evaluated parametrizing the 2x2 unitary matrix U 
in Eq. (88) as 

/e-i(x+d)cos| sin 

t/ = e-5W M. (96) 

\e-f (x-0) sin I e5(x+'^) cos | / 

A direct calculation hence shows that 
FcHip) = 2 -I- i(5£cos0^ sinh^ 2r+ 

-I-2 —-decos0^ sinh^ 2r_ 

-I- (de)^ sin^ 9 [sinh^(r_|. -|- r_) 

— sin^ X sinh 2r_|_ sinh 2r_ ], 
(97) 


where 

£ = , fc = £+ — £_. (98) 


B. Optimization of the Pure Gaussian Input State 

We optimize the QFI in Eq. (93), given Eqs. (94) and 
(97), under the constraint on the average of the number 
of probe particles N defined in Eq. (13). In the case of a 
pure Gaussian state, it can be specialized as 

(N) = sinh^ r+ -|- sinh^ r_ -|- ||a|p = N (99) 

with 

||af = ata = |a+|2 + |a_|2. (iqq) 

Without loss of generality, we take 

r+ > r_ > 0 (101) 

[if we want other configurations we can just rotate the 
squeezing matrix Q by i? in E in Eq. (87)]. Since only 
the norm ||a|p is relevant to the constraint (99), we can 
freely tune the “direction” of a. Let us first optimize 
the second contribution Fq ^ (ip) to the QFI in Eq. (94) 
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by making use of this freedom. The upper bound can be 
evaluated using the largest eigenvalue of the matrix in 
the definition of Fq\lp) in Eq. (94). We obtain 

F^\^p) <A\\eoL\\'^e^^+. ( 102 ) 


Setting X = 0, Fq\(p) is a convex parabolic function of 
cos 0 and reaches its maximum at either end of the range 
of COS0, i.e., at cos 9 = ±1. The optimal choice is 

0 = 0, (106) 


This bound can be made more helpful taking into account 
that 


llsaf =4|a+p+e2_|a_p 


+ (4-£-) 


l«+P-l«-P 


<4l|«f. (103) 


Finally we find 


and a direct calculation shows that 

max Fq\ip) = 2(e^ sinh^ 2r+ + sinh^ 2r_), (107) 

irrespective of x- The optimal value of 9 for Fq\(p) 
in Eq. (106) are compatible with those obtained for the 
maximization of Fq (ip) in Eq. (105). 

Finally, optimizing also with respect to n = Q:/||a||, 
the direction of a, we get 


F^\ip) < 44||afe2'-+. 


(104) 


This bound can be saturated by tuning the parameters 
of the unitary matrix U in Eq. (96) characterizing the 
input Gaussian state so that 


a = illalle 2 (’)+x+ 0 ) | and 0 = 0 (e+> £?_), 



a = illalle 2(’?+x) [ _ (£+=£?_). 

(105) 

We turn now to the maximization of Fq^ ((p). We notice 
that the parameters 0 and x in Eq. (97) are also irrelevant 
to the constraint (99). We next tune them to optimize 
the first contribution F^^ip) to the QFI in Eq. (97). 


max Fo(<p) 

0, n 

= 2[£4sinh^ 2r+ + 2||Q:|pe^’’+) + £?_ sinh^ 2r_], (108) 
with 

a = and 0 = 0. (109) 

The final step is the maximization of the QFI with re¬ 
spect to the squeezing parameters {r_|_, r_} and the norm 
II a IP, included in the constraint (99). In order to fulfill 
this condition, we insert 


sinh^ T- = N — sinh^ — ||a|p 
into Eq. (108). It yields 


( 110 ) 


maxFG(</j) = 8£4inh^+sinh^ r+)-|-4£i||a|pe^'’+ -I-8£?_(IV — sinh^ r+ — ||a|p)(l-|-IV — sinh^ r_|_ — ||a|P). (HI) 

6, n 


The optimization must take into account the positivity of the right-hand side of Eq. (110) for a given ||Q:|p. The 
maximum is actually reached with the largest possible r+, namely, with r_ = 0. We obtain 


• =4£+ 

given ||q:||^ 


2(iV-||af)(l + iV-||af) 


Il2 f 1 




This is a monotonically decreasing function of ||q:|P, and 
reaches its maximum at ||ck|P = 0, that is, 

= 8£2 iV(iV -h 1). (113) 

This is the maximal value of the QFI reachable using 
a pure Gaussian state as the input. In particular, the 
optimal input state is 

\^g) = |r)+ ® |0)_, r = i ln(l + 2iV + 2\/fV(iV + I)) , 

(114) 


( 112 ) 


where |r) is a squeezed vacuum state. 

Remarkably, the optimal Gaussian states in Eq. (114) 
are product states with no entanglement. Moreover, it is 
suggested to inject all the available energy into a single 
mode. The result also shows that squeezing is more ef¬ 
fective than displacing in improving the precision of the 
estimation with Gaussian light (in contrast to the claim 
in Ref. [43]) (note that the actual state to be injected into 
the two ports of the given target circuit is not simply the 
squeezed vacuum in one port a+ with the vacuum in the 
other a_, but can be generated from it by transforming 
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(a) (b) 

[xl0‘] 




FIG. 4. (a) The optimal QFI in Eq. (113) with the Gaussian input state (114) compared with the maximal value of 

the QFI in Eq. (56) with AN"^ = 2A^(A'^ + 1), i.e., in Eq. (116), for the three simple cases (i), (ii), and (hi) 

in Fig. 3 with (e+,£_) = (1, —1), (1,1), and (1,0), respectively, (b) The gaps between F^^’^\ip) and for (i)-(iii). 


it through an appropriate linear circuit representing the 
unitary transformation relating the physical modes a± to 
the normal modes c±). 

In the optimal Gaussian states (114), the variance of 
the number of probe particles N in Eq. (13) is given by 

{ANfc = 2N{N + 1). (115) 

Therefore, the above result Fq”“^(i^) in Eq. (113) with 
the optimal Gaussian state should be compared with the 
maximal value of the QFI (ip) in Eq. (56) with the 

variance AiV^ fixed at the value given by Eq. (115), i.e., 

= (|£+ - e.WN{3N + 2) 

+ \e+ + e-\y/2N{N + l)y. (116) 

As expected, in the general case there is a gap between 
and it increases with N and ap¬ 

proaches 

|£:+ — e-|^|£+ — e-|-|- 2(-\/6 — 2)|e+ -h £- 
^■\/3 |e+ — 1 + |£+ -|- £_ 1^ 

(117) 

in the asymptotic limit A —>■ oo (see Fig. 4). We notice 
also that the gap closes exactly when £+ = £_: indeed 
we already knew that any input states with average N 
and variance AA^ achieves the maximal QFI 
in Eq. (56). 


VII. CONCLUSIONS 

In this article we have addressed the problem of the 
optimization of QFI when identical bosonic particles are 
used as probes. We are in particular interested in the case 
where the number of probe particles is not fixed but can 
fluctuate. The optimization problem can be rephrased 
in terms of a problem of classical probability, and we 
have solved it for the most generic two-mode linear cir¬ 
cuit. We have found a concise expression for the maximal 
QFI, which tells us how the fluctuation in the number of 
probe particles enhances the precision of the estimation, 
and we have characterized the best input state, which is 
a generalization of the NOON state, or a generalization 
of the two-mode Schrodinger-cat state, depending on the 
properties of the circuit. We have also identified the best 
input state among pure Gaussian states, which are ubiq¬ 
uitously used in quantum optical metrology. The setup is 
quite general, includes the standard Mach-Zehnder inter¬ 
ferometer and other linear optical circuits, and is relevant 
for various practical applications. 
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